Perm Krai
Comparative Analysis of Encoder-Based NER and Large Language Models for Skill Extraction from Russian Job Vacancies
Matkin, Nikita, Smirnov, Aleksei, Usanin, Mikhail, Ivanov, Egor, Sobyanin, Kirill, Paklina, Sofiia, Parshakov, Petr
The labor market is undergoing rapid changes, with increasing demands on job seekers and a surge in job openings. Identifying essential skills and competencies from job descriptions is challenging due to varying employer requirements and the omission of key skills. This study addresses these challenges by comparing traditional Named Entity Recognition (NER) methods based on encoders with Large Language Models (LLMs) for extracting skills from Russian job vacancies. Using a labeled dataset of 4,000 job vacancies for training and 1,472 for testing, the performance of both approaches is evaluated. Results indicate that traditional NER models, especially DeepPavlov RuBERT NER tuned, outperform LLMs across various metrics including accuracy, precision, recall, and inference time. The findings suggest that traditional NER models provide more effective and efficient solutions for skill extraction, enhancing job requirement clarity and aiding job seekers in aligning their qualifications with employer expectations. This research contributes to the field of natural language processing (NLP) and its application in the labor market, particularly in non-English contexts.
- Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)
Machine learning for reconstruction of polarity inversion lines from solar filaments
Solar filaments are well-known tracers of polarity inversion lines that separate two opposite magnetic polarities on the solar photosphere. Because observations of filaments began long before the systematic observations of solar magnetic fields, historical filament catalogs can facilitate the reconstruction of magnetic polarity maps at times when direct magnetic observations were not yet available. In practice, this reconstruction is often ambiguous and typically performed manually. We propose an automatic approach based on a machine-learning model that generates a variety of magnetic polarity maps consistent with filament observations. To evaluate the model and discuss the results we use the catalog of solar filaments and polarity maps compiled by McIntosh. We realize that the process of manual compilation of polarity maps includes not only information on filaments, but also a large amount of prior information, which is difficult to formalize. In order to compensate for the lack of prior knowledge for the machine-learning model, we provide it with polarity information at several reference points. We demonstrate that this process, which can be considered as the user-guided reconstruction or super-resolution, leads to polarity maps that are reasonably close to hand-drawn ones, and additionally allows for uncertainty estimation.
- North America > United States (0.14)
- Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
- Asia > Russia (0.04)
- (2 more...)
Dealing with Sparse Rewards Using Graph Neural Networks
Gerasyov, Matvey, Makarov, Ilya
Reinforcement learning is a machine learning paradigm where an artificial agent learns the optimal behavior through interactions with a dynamic environment. Goals and purposes are explained to the agent via a scalar reward signal it receives after each interaction. Throughout the training process, the agent infers the behavior that maximizes cumulative reward, also called the return. To succeed in this task, the agent needs to explore the environment to understand which states and actions yield high rewards. On the other hand, the agent also has to exploit the rewards it has already received to adapt its behavior. This problem is known as the exploration and exploitation trade-off. This work was supported in part on Section 2 by the Strategic Project "Digital Business" within the framework of the Strategic Academic Leadership Program "Priority 2030" at the National University of Science and Technology (NUST) MISiS, in part by the Basic Research Program at the National Research University Higher School of Economics (HSE University), and in part by the Computational Resources of HPC Facilities at HSE University.
- Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.05)
- Europe > Russia > Volga Federal District > Nizhny Novgorod Oblast > Nizhny Novgorod (0.04)
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- (7 more...)
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)
Delta-Closure Structure for Studying Data Distribution
Buzmakov, Aleksey, Makhalova, Tatiana, Kuznetsov, Sergei O., Napoli, Amedeo
In this paper, we revisit pattern mining and study the distribution underlying a binary dataset thanks to the closure structure which is based on passkeys, i.e., minimum generators in equivalence classes robust to noise. We introduce $\Delta$-closedness, a generalization of the closure operator, where $\Delta$ measures how a closed set differs from its upper neighbors in the partial order induced by closure. A $\Delta$-class of equivalence includes minimum and maximum elements and allows us to characterize the distribution underlying the data. Moreover, the set of $\Delta$-classes of equivalence can be partitioned into the so-called $\Delta$-closure structure. In particular, a $\Delta$-class of equivalence with a high level demonstrates correlations among many attributes, which are supported by more observations when $\Delta$ is large. In the experiments, we study the $\Delta$-closure structure of several real-world datasets and show that this structure is very stable for large $\Delta$ and does not substantially depend on the data sampling used for the analysis.
- Europe > France > Grand Est > Meurthe-et-Moselle > Nancy (0.04)
- Asia > Russia (0.04)
- Europe > Russia > Volga Federal District > Perm Krai > Perm (0.04)
- Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
On Interpretability and Similarity in Concept-Based Machine Learning
Kwuida, Léonard, Ignatov, Dmitry I.
Machine Learning (ML) provides important techniques for classification and predictions. Most of these are black-box models for users and do not provide decision-makers with an explanation. For the sake of transparency or more validity of decisions, the need to develop explainable/interpretable ML-methods is gaining more and more importance. Certain questions need to be addressed: How does an ML procedure derive the class for a particular entity? Why does a particular clustering emerge from a particular unsupervised ML procedure? What can we do if the number of attributes is very large? What are the possible reasons for the mistakes for concrete cases and models? For binary attributes, Formal Concept Analysis (FCA) offers techniques in terms of intents of formal concepts, and thus provides plausible reasons for model prediction. However, from the interpretable machine learning viewpoint, we still need to provide decision-makers with the importance of individual attributes to the classification of a particular object, which may facilitate explanations by experts in various domains with high-cost errors like medicine or finance. We discuss how notions from cooperative game theory can be used to assess the contribution of individual attributes in classification and clustering processes in concept-based machine learning. To address the 3rd question, we present some ideas on how to reduce the number of attributes using similarities in large contexts.
- Asia > Russia (0.14)
- Europe > Switzerland > Zürich > Zürich (0.14)
- Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
- (14 more...)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.67)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
AKVIS Magnifier AI 10.0: Artificial Intelligence Technologies for Image Upscaling!
AKVIS announces the release of Magnifier AI 10.0! The new version uses artificial neural networks and machine learning groundbreaking image enlargement technologies. The update also offers full compatibility with macOS Catalina and Adobe 2020 and other changes. AKVIS Magnifier AI is efficient image resizing software. It allows blowing up images into supersize prints without loss in quality.
AKVIS Magnifier AI 10.0: Artificial Intelligence Technologies for Image Upscaling!
AKVIS announces the release of Magnifier AI 10.0! The new version uses artificial neural networks and machine learning groundbreaking image enlargement technologies. The update also offers full compatibility with macOS Catalina and Adobe 2020 and other changes. AKVIS Magnifier AI is efficient image resizing software. It allows blowing up images into supersize prints without loss in quality.
Multi-Task Multicriteria Hyperparameter Optimization
Akhmetzyanov, Kirill, Yuzhakov, Alexander
We present a new method for searching optimal hyperparameters among several tasks and several criteria. Multi-Task Multi Criteria method (MTMC) provides several Pareto-optimal solutions, among which one solution is selected with given criteria significance coefficients. The article begins with a mathematical formulation of the problem of choosing optimal hyperparameters. Then, the steps of the MTMC method that solves this problem are described. The proposed method is evaluated on the image classification problem using a convolutional neural network. The article presents optimal hyperparameters for various criteria significance coefficients.
Question Embeddings Based on Shannon Entropy: Solving intent classification task in goal-oriented dialogue system
Perevalov, Aleksandr, Kurushin, Daniil, Faizrakhmanov, Rustam, Khabibrakhmanova, Farida
Question-answering systems and voice assistants are becoming major part of client service departments of many organizations, helping them to reduce the labor costs of staff. In many such systems, there is always natural language understanding module that solves intent classification task. This task is complicated because of its case-dependency - every subject area has its own semantic kernel. The state of art approaches for intent classification are different machine learning and deep learning methods that use text vector representations as input. The basic vector representation models such as Bag of words and TF-IDF generate sparse matrixes, which are becoming very big as the amount of input data grows. Modern methods such as word2vec and FastText use neural networks to evaluate word embeddings with fixed dimension size. As we are developing a question-answering system for students and enrollees of the Perm National Research Polytechnic University, we have faced the problem of user's intent detection. The subject area of our system is very specific, that is why there is a lack of training data. This aspect makes intent classification task more challenging for using state of the art deep learning methods. In this paper, we propose an approach of the questions embeddings representation based on calculation of Shannon entropy.The goal of the approach is to produce low dimensional question vectors as neural approaches do and to outperform related methods, described above in condition of small dataset. We evaluate and compare our model with existing ones using logistic regression and dataset that contains questions asked by students and enrollees. The data is labeled into six classes. Experimental comparison of proposed approach and other models revealed that proposed model performed better in the given task.
Robot in Russiai nterrupts Putin to introduce itself
President Vladimir Putin was interrupted by a robot in a bizarre incident at a Russian technology show. The leader shook hands with a robot during a visit to a technology firm after it recognised him and introduced itself. Putin, 64, was photographed shaking hands with the black and white robot during a visit to a technology firm in the city of Perm in central Russia's Perm Krai region. While the Russian leader was talking to the machine's creator, the robot recognised Putin's face and said hello The robot features a facial recognition feature and saw the Russian President. It then interrupted his creator who was speaking to Putin and introduced itself. The robot said: 'Hello Vladimir Vladimirovich [Putin's full name, used to address him politely].
- Asia > Russia (1.00)
- Europe > Russia > Volga Federal District > Perm Krai (0.29)
- Government > Regional Government > Europe Government > Russia Government (0.97)
- Government > Regional Government > Asia Government > Russia Government (0.97)